Thermodynamic cost of acquiring information 
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Connections between information theory and thermodynamics have proven to be very useful to 
establish bounding limits for physical processes. Ideas such as Landauer's erasure principle and 
information assisted work extraction have greatly contributed not only to enlarge our understand- 
ing about the fundamental limits imposed by nature, but also to enlighten the path for practical 
implementations of information processing devices. The intricate information-thermodynamics re- 
lation also entails a fundamental limit on parameter estimation, establishing a thermodynamic cost 
for information acquisition. More specifically, we show that the amount of information that can 
be encoded in a physical system (and consequently extracted) by means of any process is limited 
by the dissipated work during the implementation of the process. This includes a thermodynamic 
trade-off for information acquisition. Likewise, any information acquisition process is ultimately 
limited by the second law of thermodynamics. Such a trade-off may find applications in several 
areas of knowledge, since parameter estimation lies in the building basis of all natural sciences and 
several technological applications, such as metrology. 
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Information theory first met thermodynamics when 
Maxwell introduced his famous Demon pQ. This rela- 
tion became clear with Brillouin's treatment of the in- 
formation entropy (due to Shannon) and the thermody- 
namic entropy (due to Boltzmann) on the same footing 
[2]. Many advances linking these two apparently dis- 
tinct areas have been achieved since then, with one of 
the most remarkable being ascribed to Landauer's era- 
sure principle [3 j. This principle, introduced as an ef- 
fectively way to exorcize Maxwell's Demon, states that 
erasure of information is a logically irreversible process 
that must dissipate energy. More recently, developments 
in this directions include theoretical and experimental in- 
vestigations of Landauer's principle and its consequences 
[U [5] , work extraction by feedback control of microscopic 
systems [6HTU] , and links between the second law of ther- 
modynamics and two fundamental quantum mechanical 
principles, i.e., the wave-function collapse [IT] and the 
uncertainty relation [12]. Here, we introduce a thermody- 
namic trade-off for information acquisition, which relates 
the uncertainty of the information acquired in a param- 
eter estimation process with the dissipated work by the 
encoding process. This trade-off relation is obtained by 
a formal connection between an elusive quantity from es- 
timation theory, named Fisher information [T3HT6] , and 
the Jarzynski equality [T7] . 

Natural sciences are based on experimental and phe- 
nomenological facts. Parameter estimation protocols 
have a central role to the observation of new phenomena 
or to validate some theoretical prediction. Suppose, we 
want to determine the value of some parameter, let us say 
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p. This task can be accomplished, generally, by employ- 
ing a probe, pr- We will assume that the probe state 
is initially in thermal equilibrium at absolute tempera- 
ture T, so pT is the the canonical equilibrium (Gibbs) 
state [15] . In order to extract some information about 
the parameter (/?, the probe could be prepared (through 
a process Y) in a suitable blank state represented by pq. 
Then the probe is transformed by a process in or- 
der to encode information about the parameter on the 
probe state p^. In general, in real world applications, 
these operations (probe state preparation and encoding 
of information) are logically irreversible and therefore, 
must have an energetic cost. The effectiveness of the esti- 
mation (metrology) process depends on how information 
is encoded in the probe system. This encoding opera- 
tion consumes some work from a thermodynamic point 
of view. An estimation of the parameter ip can be ob- 
tained by a suitable read-out of the encoded probe system 
p^. The aforementioned protocol (and also outlined in 
Fig. 1(a)) abstractly summarizes the operation of almost 
all high-precision measurement devices. Employing this 
general framework, we show that the uncertainty (the 
mean square root error) Acp of an estimation process is 
limited by a general physical principle 

A$-Al v >'f, (1) 

where ks is Boltzmann's constant and the thermody- 
namic trade-off for information acquisition is defined as 
the mean dissipated work (Wd) at a given temperature 
T as A$ = (Wd)/T and the relative acquired informa- 
tion as AT^ = (A(/?) 2 jti 1 . 5^ is a quantity describing the 
accuracy of the encoding process. Roughly speaking, 5^ 
is the precision of the experimental device used to imple- 
ment A^ (the minimum scale for (/?, see Fig. 1(b) and 
the Supplementary Information). The symbol (• • •) rep- 
resents the mean value with respect to an ensemble of 
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Figure 1. General sketch of a parameter estimation process, 
(a) The estimation process works as follows: A probe system 
initially in the thermal equilibrium, pr, at a given temper- 
ature T, is prepared in a suitable blank state po (through 
a process T). Then, acts on the probe in order to en- 
code some information about the desired parameter cp. The 
thermodynamic cost of the probe state preparation and en- 
coding of information is A$. The final step is the read-out of 
the encoded probe p<p. This measurement process results in 
a probability distribution, p^, as function of (p. After some 
statistical manipulation of this measured distribution, an es- 
timation for the value of cp is obtained with the mean square 
root error Acp. (b) Pictorial representation of an imperfect 
encoding of information. The probe state is represented as a 
pointer and the encoding process A^ has a (minimum) finite 
precision S v . 



measurements. The physical quantities appearing in Eq. 
([!]) are highly process dependent and must be carefully 
defined in each physical set-up. We proceed by analyzing 
the physical meaning of Eq. ([!]) and discussing some of 
its implications, postponing its derivation. 

The work consumed in the parameter estimation pro- 
cess could ultimately be attributed to the logical irre- 
versibility of information encoding. The first step for any 
estimation protocol is to prepare the probe. If we employ 
an out of equilibrium probe, we have to erase the thermal 
state to prepare the probe in a suitable blank state and it 
has some energetic cost (from the Landauer's principle). 
The second step, i.e., the encoding of information in the 
probe state (A^), in a realistic apparatus is not perfect 
and must therefore also dissipate some energy. In order 
words, the finite precision 5^ of the encoding operation 
implies logical irreversibility and, as a consequence, work 
dissipation. For the sake of clarity, let us discuss these 
issues in two physical contexts. 

First, we consider a simple classical thermodynamic 
system. In this classical setting all the quantities ap- 
pearing in Eq. ([TJ are naturally defined. Nevertheless, 



our results can be applied to both (out of equilibrium) 
classical and quantum systems. Let us suppose that our 
apparatus is composed of a gas confined within a cylindri- 
cal chamber in which the upper base is made of a movable 
piston with some amount of matter (M) placed over it. 
The gas is our probe system and the position of the mov- 
able piston indicates the probe state. Considering cp as 
the equilibrium position of the piston, we can encode in- 
formation on this system introducing (or removing) some 
amount of matter over the piston. The information is 
encoded by the displacement cp of the piston from its 
equilibrium position cp . Considering the parameter es- 
timation processes as described in Fig. 1(a), we have 
the following steps: (i) Initially, the system is in ther- 
mal equilibrium at temperature T, being described by 
the state pt, corresponding to piston position cp (with 
some amount of mass M over the piston). In this situ- 
ation, all the forces acting on the piston are in equilib- 
rium and its position is fixed (except for thermal fluctu- 
ations). In this context, the equilibrium state pr could 
be a suitable probe (po = Pt), so the probe preparation 
process (T of Fig. 1(a)) does not dissipate work, (ii) 
By removing (in an adiabatic or in a non-adiabatic way) 
some amount of mass (AM) over the piston the informa- 
tion can encoded in the probe. Due to the unbalance of 
forces, this operation drives the piston to a new equilib- 
rium position described by the state p^, corresponding to 
position (p, thus encoding the information in the probe 
state. During such implementation a certain amount of 
work must be employed and part of it may be dissipated 
into the environment. In order to get a good estimation 
of the encoded information, this protocol should be re- 
peated N times or N identical copies of such a system 
should be employed. Each realization of the protocol dis- 
sipates a work w l D . The mean dissipated work is given 
by (Wd) = J2p( w d) w di w ith P( w b) which is the 
measured probability distribution for the whole ensemble 
of realizations, (iii) Measuring the piston position in all 
realizations, we obtain an estimation cp = (cp) db Acp for 
the parameter, where (cp) = N~ x J2p((fi)tpi is the mean 
value of the piston displacement, p(cpi) is the observed 
probability distribution and Acp is the square root devi- 
ation. 

In the example explored above the work is dissipated 
in step (ii). When we remove a certain amount AM of 
mass from the top of the piston, the gas expands until 
a new equilibrium position is reached. The amount of 
removed mass determines how much the piston position 
changes and, ultimately, it will also determine how much 
work will be dissipated. If the whole mass is removed in 
just one shot, there will be a huge amount of dissipated 
work, since the gas will expand from the initial state 
(corresponding to pt) to the final one (corresponding to 
p<p), through a sudden and irreversible path [19]. The 
amount of dissipated work is given by the second law of 
thermodynamics as (Wd) — (W) — AF, where AF is 
the difference in free energy between the final and the 
initial states. In this case, the final position of the piston 
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(the encoded information) will deviate from the predicted 
ideal non-dissipative one. This example clearly shows 
that some information is lost in the encoding process due 
to work dissipation caused by finite changes in the system 
(irreversibility) . 

To minimize the information loss and, consequently, to 
improve the protocol precision, we have to diminish dissi- 
pation as much as possible. This can be accomplished by 
removing the mass in small portions, with the limit be- 
ing the idealized reversible process, for which (Wd) = 0. 
In this case, Eq. ^ seems to be flawed, but a deeper 
analysis reveals that this is not the case. For the imple- 
mentation of a reversible process, we must take the limit 
AM — ^ (the process must be implemented in a quasi- 
static manner). But, this limit implies 5^ — » 0, since 5^ 
is the minimum step size that the piston is able to move, 
i.e., the minimum change in the system (see below and 
the Supplementary Information for formal details). In 
the limit of reversible processes we can read-out all the 
information encoded in the probe. On the other hand, 
in the real word the "scale" of the encoding apparatus is 
finite. In this case, the minimum amount of mass that 
can be removed is finite, this also introduces a minimum 
step size for the piston position, 5^ > 0. This inevitably 
leads to information loss in the encoding process due to 
work dissipation. In fact, any realistic encoding appara- 
tus with a finite precision (scale) is irreversible, therefore 
the apparatus must dissipate energy, introducing uncer- 
tainty in the parameter estimation as ultimately bounded 
by Eq. ([!]). Although, we have explored a specific exam- 
ple, this statement is independent of the physical system, 
just like the fact that irreversible processes (associated 
with finite changes in the system) must dissipate work 
and increase entropy. 

Two other important limits are the zero and infinity 
temperature. Regarding, as discussed early, that all real 
processes are irreversible and information is inevitably 
lost, we have (Wd) > 0, implying that 5<p > 0. Let us 
assume that both dissipated work (Wd) and the encoding 
accuracy 5<p are constants with respect to the tempera- 
ture. When T -> oo Eq. ^ leads to (A(p) 2 oo. The 
observer cannot obtain any information encoded by the 
process A^. Actually, we cannot encode any information 
in this limit due to the infinite amplitude of thermal fluc- 
tuations, which wash out all the information, no matter 
how precise the encode process is. In the opposite limit, 
T — ^ 0, we have (A(p) 2 > 0. For classical systems, this 
is a valid limit and the inequality could be, in principle, 
saturated. However, due to the third law of thermody- 
namics, it is not allowed for quantum systems to reach 
this limit. (A(p) 2 is always greater than zero due to quan- 
tum fluctuations. 

The bound presented in Eq. ([!]) also holds for quan- 
tum strategies for parameter estimation, employing out 
of equilibrium probes. Now, let us consider a standard 
interferometric strategy to estimate a phase shift between 
two states. This task can be accomplished observing the 
probability for the measurement of the probe in a suit- 



able basis. We are going to label the two states by |0) 
and |1). A suitable probe in this case is a balanced su- 
perposition, |Vo> = (|0) + |1» /V2 (p = |^o) W>oD- The 
probe state is out of equilibrium and some work has to be 
dissipated to prepare it through the process T depicted in 
Fig. 1(a). This process could be a suitable post-selected 
projective measurement on the thermal equilibrium state 
pT- This non-unitary operation erasures information and 
dissipates some energy according to Landauer's princi- 
ple. The encoding of information could be employed by 
a phase shifter, as for example: U(<p) = e^' 1 ^ 1 '. In 
a real interferometer the minimum step size 5^ to en- 
code the phase cp on the probe state is finite. Therefore, 
we have an imperfect encode (as pictorially described in 
Fig. 1(b)) and in an ensemble of realizations, the evolu- 
tion is not unitary, described by a trace preserving map, 
which is irreversible. This finite accuracy of the phase 
shifter (5^ > 0) implies information loss and work con- 
sumption in the encoding process A^. In fact, the probe 
state preparation is also a non-ideal process introduc- 
ing another source of dissipated work. The bound for 
information-acquisition in the out of equilibrium quan- 
tum context is also given by Eq. ([!]). Besides the funda- 
mental limitation introduced for a finite accuracy of the 
phase shifter, there is also the decoherence phenomenon 
that implies information loss and work consumption, re- 
stricting the quantum-interferometer precision. 

Discussions. We introduced a physical principle that 
bounds information acquisition, Eq. 0, derived from 
the Jarzynski equality [17] and the Cramer- Rao [20j [21] 
relation. This is a general result, applicable to classical 
or quantum contexts, stating that the amount of infor- 
mation that can be encoded by means of a given pro- 
cess is limited by the dissipated work (due to logical ir- 
reversibility) during the implementation of the estima- 
tion process. This conclusion reveals a deep connection 
between Metrology and Thermodynamics, implying that 
the physical limit for the precision of a parameter estima- 
tion process (which is equivalent to encoding and decod- 
ing information) given by Thermodynamics. Moreover, 
the lower bound on the uncertainty about the estimation 
of a given parameter is zero only in the thermodynamic 
limit of reversible processes (second law). 

The inequality could be conceived as a counterpart 
of Landauer's principle, as both of them are assertions 
about the work cost of information (acquisition or erase). 
Furthermore, it would be interesting to investigate the re- 
lation of the results herein with generalized uncertainty 
relations. At this point, it is reasonable to presume that 
probably the basic principles of quantum mechanics it- 
self are probably subtly connected to the second law of 
thermodynamics [I2j [22j [23] in an informational point of 
view. 

From the point of view of the experimental verification 
of Eq. (1), it is important to precisely establish the sys- 
tem, in order to define all the quantities involved, such 
as the work employed in the process and how the infor- 
mation is encoded and read-out. 
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Discussing the fundamentals of physics, Planck has ar- 
gued that the number of dimensional fundamental con- 
stants in nature should be equal to four [24]: The New- 
tonian gravitational constant G, the speed of light c, 
Planck's and Boltzmann's constants h and respec- 
tively. The authors of Ref. [25] concluded that this num- 
ber should be two, chosen between G, c and ft, having 
discarded Boltzmann's constant for being a conversion 
factor between temperature and energy. In Ref. [26] , the 
viewpoint that Planck's constant is superfluous was ad- 
vocated and ks was also discarded for the same reason 
given in [25] . If we define temperature as twice the mean 
value of the energy stored in each degree of freedom of a 
system in thermal equilibrium, T = 2(Fq), Kb turns out 
to be a dimensionless quantity equal to one and Eq. ([!]) 
becomes 



(Wd) 



> 1, 



(2) 



which says that the precision of the information acquired 
in a parameter estimation process is limited by the mean 
dissipated work per degree of freedom of the encoding 
system. On a more practical ground, inequality ([!]) is 
quite meaningful for technological applications on metr- 
logy. 

Methods. Fisher Information and Thermodynamics. 
Here, we outline the derivation of Eq. ([!]) postponing 
the details to the Supplementary Information. Consider 
again the general estimation process described in Fig. 
1(a). To inspect how a given unbiased estimator for the 
parameter cp is close to the real encoded information, we 
can use the so-called Cramer- Rao bound [20] [21] 



(3) 



where T is the Fisher Information, usually defined as 

T = J dxp^ (x) { ^ ln Q^ X ^ • P(p(x) is the probability dis- 
tribution for the best read-out strategy of the encoded 
probe and it contains the information about cp. For our 
proposal, it will be interesting to express the Fisher in- 
formation in terms of a relative entropy as (see Supple- 
mentary Information) p~5j H6] 



S{p™\\&*) 



(4) 



with being the read-out probability distribution ob- 
tained in a real (irreversible) experiment, p l £ being the 
read-out probability distribution of an ideal (reversible) 
parameter estimation protocol, and 5<p is the accuracy of 
the process. 

Next, we relate the Fisher information with informa- 
tion loss and dissipation through Jarzynski equality [17] . 
Considering that a system is driven, through the injection 



exp • 



i, 



(5) 



of work W by an external agent, from the initial equilib- 
rium state to some final one, Jarzynski proved that [T7] 

(W - AF) \ 
k B T J 

where the mean is computed over the ensemble and AF is 
the free energy difference between the final and the initial 
system's states. T is the temperature of the initial equi- 
librium state. Very recently, arguing that the Jarzynski 
equality could be obtained from an information-theoretic 
relation, Vedral showed that [27 



exp 



(VV - AF) 



})-<■ 



(6) 



where (T XjX ^ is the mutual information between the 
read-out distribution for final probe state (where param- 
eter cp is encoded, p^) and the distribution for the initial 
thermal state (pr)- % is some parameter characterizing 
the distribution. 

In addition to the above results, we can show (see Sup- 
plementary Information) that 



2 ' 



(7) 



where T is the Fisher information (classical or quantum) 
for the encoded state p v and 

l\ <1 (8) 



is the relative accuracy of the estimation process. 



In 



this equation the supper-index re refers to the real irre- 
versible process, whereas id refers to the ideal reversible 
process. The approximation in Eq. ([8| means that the 
error in the measurement must be much smaller than the 
parameter being measuring. This is quite a reasonable 
assumption since an error of the same order of the pa- 
rameter would render meaningless the entire parameter 
estimation process. 

From Eq. ([6| we obtain (see Supplementary Informa- 
tion) 



(VV) - AF 
k B T ' 



(9) 



Combining everything together, from Eqs. (J3|, and 
([9]), we show that 



(A^) 2 > 
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(10) 



which is our main result expressed in Eq. Q as a trade- 
off relation. 
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SUPPLEMENTARY INFORMATION 



Appendix A: Dissipation- Information Inequality 



In this section we will obtain relation (9) of the main text. Remembering that we are interested in the general 
description of a parameter estimation process described in Fig. 1(a) of the main text. A key quantity in our derivation 
is the mutual information I (po : p^) between the outcomes of the state-preparation (r^) and the final read-out (Hk) 
measurements. In the quantum scenario the joint probability distribution for the outcomes of two single positive 

operator valued measurements is Po<p(j,k) = Tr II^ (i^jPtTJ^J H\, (J2jFj = I and = I) . The mutual 

information / (po : p^) may be written in terms of the information density 

^ (j, k) = (j, k) - In [p (j) p^ (k)} , (Al) 

as I (po ' pip) = (Zo-.ip (j,k)), with p^ (k) = J2jPoip(j,k) being the marginal probability for the encoded state, with 
a similar definition for po (j). This mutual information quantify the correlations between the second and the first 
measurements. It is the difference between the information contained in the joint probability and the information 
contained in the product of the marginal distributions. 

We may consider two distinct processes, the ideal (reversible) and the real (irreversible). The difference between 
then is that the second includes all the sources of errors and dissipation, while the first one is considered error-free 
(dissipation- free). The difference in the densities of the mutual information between both processes is given by 



r f v (j,k)-i ( 



l%(j,k)=\n V l% (j,k)- 
Po% (j> k) 



In- 



In [p r e (j) p™ (k)] - lnpgj, (j, k) + In [pf (j) p% (k)] 
P r o e U) 



(A2) 

Ph%{hk) ^pjfU) 

where the labels re and id means the real and the ideal processes, respectively. Multiplying now by (j, k) and 
summing over j and k we obtain 



(r %(j,k)) r 



S(p 



s(p r o e \\ P if) 



U,k) 



In 



Pl%{hk) ^^(j) ^P r v e (k) 



Pl% U, k) 



In- 



Ph d U) pi d (k) 



s( P r ;\K d ). 



(A3) 
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In this last expression 



S(p||«) = X>(a)ln^ (A4) 
Q \ a ) 



is the relative entropy between distributions p and q. (-) re means that the average is taken over the real probability 
distribution. 

In Ref. [27] of the main text it was shown that 

<W,*)>„ = ^, (A5) 

with (Wj)) being the mean dissipated work during the implementation of the (real) encoding process, T the temper- 
ature and the Boltzmann constant. 
In the next section we show that 

s{pi e M%) «f^(p; rf ), (A6) 

with 5^ being the accuracy of the implementation of the process and T (p 1 ^) the Fisher information of the final ideal 
distribution (see main text and the next section of this Supplementary Information for the physical significance and 
precise mathematical definition of these quantities). 
Putting all these results together we obtain 

k B T "> 

with 

n = {xi% u, k)) re - s( P r e | \pt?) - s( P r ; \ \ P %). (A7) 

As a consequence of the inequality in Cramer-Rao relation (Refs. [20, 21 of the main text) our main result, Eq. 
(1) of the main text, is also an inequality. Therefore, to prove it all we have to do is to prove that £1 > for all 
distributions. Thus 

« = $>o£ U, k) [ln< (j, k) - lap*? (j) - lnp% (k)] 
-Y,P r o e U) [lnpS e 0') - ln^ d 0')] 

3 

-E^ e w[ ln ^ e ( fc )- ln <( & )] 

k 

= X>5£ & fc ) ln < & *) + H (p~) + H (p™) , (A8) 

where H (p) = — ^Z k p(k) \np(k) is the Shannon entropy. The first term of this last expression measures our lack of 
knowledge about the ideal probability distribution. In fact, ]np}f is a measure o the information contained in the 
ideal distribution (the one we expected to happen). However, events occur accordingly with the real distribution p^ 
(due to experimental non-idealities, as for example: dissipation). This cause a loss of information. It is not difficult 
to show that 

H (pg£) < - X>5° U, k) ln< (?. k) , (A9) 

3,k 

for every probability distribution. In order to have Q > the following relation must be obeyed 

H (pg e ) + H (p™) >-E^ {3, k) lnpft (j, k)>H (p^ ) , (A10) 

which is the well know superaditivity relation for the entropy. Thus, we are lead to conclude that 



k B T 2 

which proves Eq. (9). 



-F(p; e )>0, (All) 
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Appendix B: Fisher information and relative entropy 



One way to define the Fisher information of a probability distribution p^ is through the calculation of the relative 
entropy between p^ and p^+s^ yielded by a small shift 5<p in the parameter tp. 5^ is then a measure of the accuracy 
of the process, i.e., the minimum error compatible with the specific process under consideration. It quantifies how 
much the real probability distribution diverges from the ideal one. We can write the relative entropy S(p lfi \\p ip ^s (p ) as 
(see Refs. [I5j [16] of the main text) 



3,k 



P<p(j,k) 



By using a Taylor expansion about 5^ we can write 



d\np^ ^ S 2 d 2 lnp y 

dip 2 dip 2 



0{5%). 



Now, substituting (B2) in (Bl), the first order term yields 

>P<p(j,k) dp^jj.k) _ 



6 E 



3,k 



p<p(j,k) dip 



'dip 



^2p<ptii k ) = ° 



3,k 



while the second one leads us to 

d 2 \ripp 



^^Ptp&k)- 



3,k 



dip 2 



4e 



P<p(J,k) d 2 Pv(j,k) 



2 ^PtpUik) dip 2 

3i K 



5 2 

-f52p<p(J,k) 



3,k 



1 dPy{j, k ) 
p<p(j,k) dip 



Since the probability distribution is normalized, 
and keeping terms up to second order we obtain 



3,k 



d 2 p<p 
dip 2 



= 0, 



j,k ^ 



where J-{p v ) is the Fisher information of p v . This approximation implies that we must have 



<1, 



(Bl) 



(B2) 



(B3) 



(B4) 



(B5) 



(B6) 



(B7) 



meaning that the error in the measurement is much smaller than the parameter we are measuring. This is a quite 
reasonable assumption since an error of the same order of the parameter would turn meaningless the measurement 
process. 

In order to prove Eq. (A6) let us introduce the Jeffreys 7 divergence (Ref. [15] of the manuscript) 

J (Pv , Pip+Sy ) =S{p<p\ IPv+Sy ) + S {ptp+8,, I \P* ) 

= Ei, fc M, k ) ~ P<p+6 v (7, k)) In • 



(B8) 



Since ( B8 ) is symmetrical, we may rewrite it as 



(B9) 
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where Ap^+s = p^+s — p^. Due to the fact that 5^ is small, the distribution p^+s will be close from p^. Therefore 



In 



allowing us to write 



3,k 



P<pti,k) 



Plp(jl k ) 



5^ Afrp+5^ (j> fc) 
2fy ^ 



1 dp v (j,k) 
p v (j,k) dip 



From equation (|B8) we have S^+sJIp^,) = J(p v ,p v +sJ - S(p v \\p v +s v ), so 



(BIO) 



(Bll) 



(B12) 



Finally, making the identifications p^ = pgj, and = we obtain Eq. (A6), thus proving our main claim, 

Eq. (1) of the manuscript. 



